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>AB025413 ACCESSION: AB025413 NID: gi 4760781 dbj AB025413.1 Mus 
musculus mRNA for Ten-m4 , complete cds 
Length = 8585 

Score = 5617 bits (14410), Expect = 0.0 

Identities = 2673/2771 (96%), Positives = 2713/2771 (97%), Gaps = 18/2771 (0%) 
Frame = +2 

Query: 1 MDVKERKPYRSLTRRRDAERRYTSSSADSEEGKAPQKSYSSSETLKAYDQDARLAYGSRV 60 

MDVKERKPYRSLTRRRDAERRYTSSSADSEEGK PQKSYSSSETLKAYDQDARLAYGSRV 
Sbjct: 188 MDVKERKPYRSLTRRRDAERRYTSSSADSEEGKGPQKSYSSSETLKAYDQDARLAYGSRV 367 

Query : 61 KDIVPQEAEEFCRTGANFTLRELGLEEVTPPHGTLYRTDIGLPHCGYSMGAGSDADMEAD 120 

KD+VPQEAEEFCRTG NFTLRELGL E +T P PHGTL YRTD I GL PHCGYSMGA SDAD+EAD 
Sbjct : 368 KDMVPQEAEEFCRTGTNFTLRELGLGEMTPPHGTLYRTDIGLPHCGYSMGASSDADLEAD 547 

Query: 121 TVIiSPEHPVllLWGRSTRSGRSSCLSSRANSNLTLTDTEHENTETDHPGGLQNHARLRTPP 180 

TVLS PEHPVRLWGRSTRSGRS SCLS SRANSNLTLTDTEHENTETDHP LQNH RLRTPP 
Sbjct: 548 TVLS PEHPVRLWGRSTRSGRS SCLS SRANSNLTLTDTEHENTETDHPSSLQNHPRLRTPP 727 

Query : 181 PPLSHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTDHSLSGEPPAGGAQEPAHAQENWL 240 

PPL HAHTPNQHHAASINSLNRGNFTPRSNPSPAPTDHSLSGEPPAG AQEP HAQ+NWL 
Sbjct: 728 PPLPHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTDHSLSGEPPAGSAQEPTHAQDNWL 907 

Query: 241 LNSNIPLETRNLGKQPFLGTLQD^JLIEMDILGASRHDGAYSDGHFLFKPGGTSPLFCTTS 300 

LNSNIPLETRNLGKQPFLGTLQDNLIEMDIL ASRHDGAYSDGHFLFKPGGTS PLFCTTS 
Sbjct : 908 LNSNIPLETRNLGKQPFLGTLQDNLIEMDILSASRHDGAYSDGHFLFKPGGTS PLFCTTS 1087 

Query : 301 PGYPLTSSTVYSPPPRPLPRSTFARPAFNLKKPSKYCNWKCAALSAIVI S ATLVI LLAYF 360 

PGYPLTSSTVYSPPPRPLPRSTF+RPAFNLKKPSKYCNWKCAALSAI + ISATLVILLAYF 
Sbjct: 1088 PGYPLTSSTVYSPPPRPLPRSTFSRPAFNLKKPSKYCNWKCAALSAILISATLVILLAYF 1267 

Query: 361 VAMHLFGLNWHLQPMEGQM--YEITEDTASSWPVPTDVSLYPSGGTGLETPDRKGKGTTE 418 

VAMHLFGLNWHLQPMEGQM YEITEDTASSWPVPTDVSLYPSGGTGLETPDRKGKG E 
Sbjct: 1268 VAMHLFGLNWHLQPMEGQMQMYEITEDTASSWPVPTDVSLYPSGGTGLETPDRKGKGAAE 1447 

Query: 419 GKPSSFFPEDSFIDSGEIDVGRRASQKIPPGTFWRSQVFIDHPVHLKFNVSLGKAALVGI 478 

GKPSS FPEDSFIDSGEIDVGRRASQKIPPGTFWRSQVFIDHPVHLKFNVSLGKAALVGI 
Sbjct: 1448 GKPSSLFPEDSFIDSGEIDVGRRASQKIPPGTFWRSQVFIDHPVHLKFNVSLGKAALVGI 1627 

Query : 479 YGRKGLPPSHTQFDFVELLDGRRLLTQEARSLEGTPRQSRGTVPPSSHETGFIQYLDSGI 538 

YGRKGLPPSHTQFDFVELLDGRRLLTQEARSLEG RQSRG VPPSSHETGFIQYLDSGI 
Sbjct: 1628 YGRKGLPPSHTQFDFVELLDGRRLLTQEARSLEGPQRQSRGPVPPSSHETGFIQYLDSGI 1807 

Query: 539 WHLAFYNDGKESEWSFLTTAIESVDNCPSNCYGNGDCISGTCHCFLGFLGPDCGRASCP 598 

WHLAFYNDGKESEWSFLTTAIESVDNCPSNCYGNGDCISGTCHCFLGFLGPDCGRASCP 
Sbjct: 1808 WHLAFYNDGKESEWSFLTTAIESVDNCPSNCYGNGDCISGTCHCFLGFLGPDCGRASCP 1987 

Query : 599 VLCSGNGQYMKGRCLCHSGWKGAECDVPTNQCIDVACSNHGTCITGTCICNPGYKGESCE 658 

VLCSGNGQYMKGRCLCHSGWKGAECDVPTNQC IDVACS+HGTC I GTCICNPGYKGESCE 
Sbjct: 1988 VLCSGNGQYMKGRCLCHSGWKGAECDVPTNQCIDVACSSHGTCIMGTCICNPGYKGESCE 2167 

Query: 659 EVDCMDPTCSGRGVCVRGECHCSVGWGGTNCETPRATCLDQCSGHGTFLPDTGLCSCDPS 718 

EVDCMDPTCS RGVCVRGECHCSVGWGGTNCETPRATCLDQCSGHGTFLPDTGLC+CDPS 
Sbjct: 2168 EVDCMDPTCSSRGVCVRGECHCSVGWGGTNCETPRATCLDQCSGHGTFLPDTGLCNCDPS 2347 



.J 



Query: 719 WTGHDCSIEICAADCGGHGVCVGGTCRCEDGWMGAACDQRACHPRCAEHGTCRDGKCECS 778 

WTGHDC S I E I C AADC GGHGVC VGGTC RC EDGWMGAAC DQRAC H P RC AEHGTC RDGKC EC S 
Sbjct: 2348 WTGHDC S I EICAADCGGHGVCVGGTCRCEDGWMGAACDQRACHPRCAEHGTC RDGKC ECS 2527 

Query: 779 PGWNGEHCTI EGCPGLCNGNGRCTLDLNGWHCVCQLGWRGAGCDTSMETAC 829 

PGWNGEHCTI EGCPGLCNGNGRCTLDLNGWHCVCQLGWRG GCDTSMET C 

Sbjct: 2528 PGWNGEHCTIAHYLDRWKEGCPGLCNGNGRCTLDLNGWHCVCQLGWRGTGCDTSMETGC 2707 

Query: 830 GDSKDNDGDGLVDCMDPDCCLQPLCHINPLCLGSPNPLDIIQETQVPVSQQNLHSFYDRI 889 

GD KDNDGDGLVDCMDPDCCLQPLCH+NPLCLGSP+PLDIIQETQ PVSQQNL+ FYDRI 
Sbjct: 2708 GDGKDNDGDGLVDCMDPDCCLQPLCHVNPLCLGSPDPLDIIQETQAPVSQQNLNPFYDRI 2887 

Query: 890 KFLVGRDSTHIIPGENPFDGGHACVIRGQVMTSDGTPLVGVNISFVNNPLFGYTISRQDG 949 

KFLVGRDSTH IPGENPFDGGHACVIRGQVMTSDGTPLVGVNISF+NNPLFGYTISRQDG 
Sbjct: 2888 KFLVGRDSTHSIPGENPFDGGHACVIRGQVMTSDGTPLVGVNISFINNPLFGYTISRQDG 3067 

Query: 950 SFDLVTNGGI S I ILRFERAPFITQEHTLWLPWDRFFVMETI IMRHEENEI PSCDLSNFAR 1009 

SFDLVTNGGISIILRFERAPFITQEHTLWLPWDRFFVMETI+MRHEENEIPSCDLSNFAR 
Sbjct: 3068 SFDLVTNGGI S I ILRFERAPF I TQEHTLWLPWDRFFVMET I VMRHEENE I PSCDLSNFAR 3247 

Query: 1010 PNPWSPSPLTSFASSCAEKGPIVPEIQALQEEISISGCKMRLSYLSSRTPGYKSVLRIS 1069 

PNPWSPSPLTSFASSCAEKGPIVPEIQALQEEI I +GCKMRLS YLSSRTPGYKSVLRI S 
Sbjct: 3248 PNPWSPSPLTSFASSCAEKGPIVPEIQALQEEIVIAGCKMRLSYLSSRTPGYKSVLRIS 3427 

Query: 1070 LTHPTIPFNLMKVHLMVAVEGRLFRKWFAAAPDLSYYFIWDKTDVYNQKVFGLSEAFVSV 1129 

LTHPTIPFNLMKVHLMVAVEGRLFRKWFAAAPDLSYYFIWDKTDVYNQKVFG SEAFVSV 
Sbjct: 3428 LTHPTIPFNLMKVHLMVAVEGRLFRKWFAAAPDLSYYF I WDKTDVYNQKVFGF SEAFVSV 3607 

Query: 1130 GYEYESCPDLILWEKRTTVLQGYEIDASKLGGWSLDKHHALNIQSGILHKGNGENQFVSQ 1189 

G YE YE SC PDL I LWEKRT VLQGYEIDASKLGGWSLDKHHALNIQSGILHKGNGENQFVSQ 
Sbjct: 3608 GYEYESCPDLILWEKRTAVLQGYEIDASKLGGWSLDKHHALNIQSGILHKGNGENQFVSQ 3787 

Query: 1190 QPPVIGSIMGNGRRRSISCPSCNGLADGNKLLAPVALTCGSDGSLYVGDFNYIRRIFPSG 1249 

QPPVIGSIMGNGRRRSISCPSCNGLADGNKLLAPVALTCGSDGSLYVGDFNYIRRIFPSG 
Sbjct: 3788 QPPVIGSIMGNGRRRSISCPSCNGLADGNKLLAPVALTCGSDGSLYVGDFNYIRRIFPSG 3967 

Query: 1250 NVTNILEL SHSPAHKYYLATDPMSGAVFLSDSNSRRVFKIKSTVWKDLVKNS 1302 

NVTNILE+ SHSPAHKYYLATDPMSGAVFLSD+NSRRVFK+KST WKDLVKNS 

Sbjct: 3968 NVTNILEMRNKDFRHSHSPAHKYYLATDPMSGAVFLSDTNSRRVFKVKSTTWKDLVKNS 4147 

Query: 1303 EWAGTGDQCLPFDDTRCGDGGKATEATLTNPRGITVDKFGLIYFVDGTMIRRIDQNGII 1362 

EWAGTGDQCLPFDDTRCGDGGKATEATLTNPRGITVDKFGLIYFVDGTMIRR+DQNGII 
Sbjct: 4148 EWAGTGDQCLPFDDTRCGDGGKATEATLTNPRGITVDKFGLIYFVDGTMIRRVDQNGII 4327 

Query: 1363 STLLGSNDLTSARPLSCDSVMDISQVHLEWPTDLAINPMDNSLYVLDNNVVLQISENHQV 1422 

STLLGSNDLTSARPLSCDSVM+ISQV LEWPTDLAINPMDNSLYVLDNNWLQISENHQV 
Sbjct: 4328 STLLGSNDLTSARPLSCDSVMEISQVRLEWPTDLAINPMDNSLYVLDNNWLQISENHQV 4507 

Query: 1423 RIVAGRPMHCQVPGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEKKINRIRQVT 1482 

RIVAGRPMHCQVPGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEKKINRIRQVT 
Sbjct: 4508 RIVAGRPMHCQVPGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEKKINRIRQVT 4687 

Query: 1483 TSGEISLVAGAPSGCDCKNDANCDCFSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNI 1542 

TSGEISLVAGAPSGCDCKNDANCDCFSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNI 
Sbjct: 4688 TSGEISLVAGAPSGCDCKNDANCDCFSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNI 4867 



Query: 1543 RIRFIRKNKPFLNTQNMYELSSPIDQELYLFDTTGKHLYTQSLPTGDYLYNFTYTGDGDI 1602 

RI RF I RKNKPFLNTQNMYEL S S P I DQEL YLFDT + GKHL YTQSLPTGD YL YNFT YTGDGD I 

Sbjct: 4868 RIRFIRKNKPFLNTQNMYELSSPIDQELYLFDTSGKHL YTQSLPTGD YLYNFTYTGDGDI 5047 

Query: 1603 TLITD1MGNMV1WRRDSTGMPLWLWPDGQVYWVTMGTNSALKSV 1662 
T ITDNNGlSnyiVWRRDSTG 

Sbjct: 5048 THITDNNGNMVIWRRDSTGMPLV^WPDGQVYWTMGTNSA^ 5227 

Query: 1663 SGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTI 1722 

SGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTI 

Sbjct: 5228 SGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTI 5407 

Query: 1723 TTNLSASGAFYTLLQDQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKR 1782 

TTNLSASGAFYTLLQDQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKR 

Sbjct: 5408 TTNLSASGAFYTLLQDQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKR 5587 

Query: 1783 NVTLPIDNGLNLVEWRQRKEQARGQVTVFGRRLRVHNRNLLSLDFDRVTRTEKIYDDHRK 1842 

NVTLPIDNGLNLVEWRQRKEQARGQVTVFGRRLRVHNRNLLSLDFDRVTRTEKIYDDHRK 

Sbjct: 5588 NVTLPIDNGLNLVEWRQRKEQARGQVTVFGRRLRVHNRNLLSLDFDRVTRTEKIYDDHRK 5767 

Query: 1843 FTLRILYDQAGRPSLWSPSSRLNGVNVTYSPGGYIAGIQRGIMSERMEYDQAGRITSRIF 1902 

. FTLRILYDQAGRPSLWSPSSRLNGVNVTYSPGG+IAGIQRGIMSERMEYDQAGRITSRIF 

Sbjct: 5768 FTLRILYDQAGRPSLWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYDQAGRITSRIF 5947 

Query: 1903 ADGKTWSYTYLEKSMVLLLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNI 1962 

ADGK WSYTYLEKSMVL LHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNI 

Sbjct: 5948 ADGKMWSYTYLEKSMVLHLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNI 6127 

Query: 1963 YQPPEGNASVIQDFTEDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETA 2022 

YQPPEGNASVIQDFTEDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETA 

Sbjct: 6128 YQPPEGNASVIQDFTEDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETA 6307 

Query: 2 023 GMLKTINLQNEGFTCTIRYRQIGPLIDRQIFRFTEEGMVNARFDYNYDNSFRVTSMQAVI 2082 

GMLKT+NLQNEGFTCTIRYRQIGPLIDRQIFRFTEEGMVNARFDYNYDNSFRVTSMQAVI 

Sbjct: 6308 GMLKTVNLQNEGFTCTIRYRQIGPLIDRQIFRFTEEGMVNARFDYNYDNSFRVTSMQAVI 6487 

Query: 2083 NETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRMKEVQYEI 2142 

NET PL P I DL YRYDDVSGKTEQFGKFGVT YYD INQ I ITTAVMTHTKHFDAYGRMKEVQYE I 

Sbjct: 6488 NETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRMKEVQYEI 6667 

Query: 2143 FRSLMYWMTVQYDNMGRWKKELKVGPYANTTRYSYEYDADGQLQTVSIlSro 2202 

FRSLMYV^TVQYDNMGRWKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYD 

Sbjct: 6668 FRSLMYWMTVQYDNMGRWKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYD 6847 

Query: 2203 LNGNLHLLSPGNSARLTPLRYDIRDRITRLGDVQYKMDEDGFLRQRGGDIFEYNSAGLLI 2262 

LNGNLHLLSPGNSARLTPLRYD+RDRITRLGDVQYKMDEDGFLRQRGGD+FEYNSAGLLI 

Sbjct: 6848 LNGNLHLLSPGNSARLTPLRYDLRDRITRLGDVQYKMDEDGFLRQRGGDVFEYNSAGLLI 7027 

Query: 2263 KAYNRAGSWSVRYRYDGLGRRVSSKSSHSHHLQFFYADLTNPTKVTHLYNHSSSEITSLY 2322 

KAYNRA WSVRYRYDGLGRRVSSKSSHSHHLQFFYADLTNPTKVTHLYNHSSSEITSLY 

Sbjct: 7028 KAYNRASGWSVRYRYDGLGRRVSSKSSHSHHLQFFYADLTNPTKVTHLYNHSSSEITSLY 7207 

Query: 2323 YDLQGHLFAMELSSGDEFYIACDNIGTPLAVFSGTGLMIKQILYTAYGEIYMDTNPNFQI 2382 

YDLQGHLFAMELSSGDEFYIACDNIGTPLAVFSGTGLMIKQILYTAYGEIYMDTNPNFQI 

Sbjct: 7208 YDLQGHLFAMELSSGDEFYIACDNIGTPLAVFSGTGLMIKQILYTAYGEIYMDTNPNFQI 7387 



Query: 2383 IIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDH 2442 

1 1 GYHGGL YDPLTKLVHMGRRDYDVLAGRWTS PDHELWK LSS++++PF+ LYMFKNNNP I 
Sbjct: 7388 IIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDHELWKRLSSNSIVPFHLYMFKNNNPI 7567 

Query: 2443 SNSQDIKCFMTDVNSWLLTFGFQLHNVIPGYPKPDMDAMEPSYELIHTQMKTQEWDNSKS 2502 

SNSQDIKCFMTDWSWLLTFGFQLHNVIPGYPKPD DAMEPSYEL+HTQMKTQEWDNSKS 
Sbjct: 7568 SNSQDIKCFMTDVNSWLLTFGFQLHNVIPGYPKPDTDAMEPSYELVHTQMKTQEWDNSKS 7747 

Query: 2503 ILGVQCEVQKQLKAFVTLERFDQLYGSTITSCQQAPKTKKFASSGSVFGKGVKFALKDGR 2562 

ILGVQCEVQKQLKAFVTLERFDQLYGSTITSCQQAP+TKKFASSGS+FGKGVKFALKDGR 
Sbjct: 7748 ILGVQCEVQKQLKAFVTLERFDQLYGSTITSCQQAPETKKFASSGSIFGKGVKFALKDGR 7927 

Query: 2563 VTTD 1 1 S VANEDGRRVAA I LNHAHYLENLHFT I DGVDTH YFVKPGP S EGDLAI LGL SGGR 2622 

VTTDIISVANEDGRR+AAILN+AHYLENLHFTIDGVDTHYFVKPGPSEGDLAILGLSGGR 
Sbjct: 7928 VTTDI I SVANEDGRRIAAILNNAHYLENLHFTIDGVDTHYFVKPGPSEGDLAILGLSGGR 8107 

Query: 2623 RTLENGVNVTVSQINTVLNGRTRRYTDIQLQYGALCLNTRYGTTLDEEKARVLELARQRA 2682 

RTL ENGVNVTVS Q INT + L + GRTRRYTD I QLQ Y ALCLNTRYGTT+DEEK RVLELARQRA 
Sbjct: 8108 RTLENGVNVTVSQINTMLSGRTRRYTDIQLQYRALCLNTRYGTTVDEEKVRVLELARQRA 8287 

Query: 2683 VRQAWAREQQRLREGEEGLRAWTEGEKQQVLSTGRVQGYDGFFVISVEQYPELSDSANNI 2742 

VRQAWAREQQRLREGEEGLRAWT+GEKQQVL+TGRVQGYDGFFV SVEQYPELSDSANNI 
Sbjct: 8288 VRQAWAREQQRLREGEEGLRAWTDGEKQQVLNTGRVQGYDGFFVTSVEQYPELSDSANNI 8467 

Query: 2743 HFMRQSEMGRR 2753 

HFMRQSEMGRR 
Sbjct: 8468 HFMRQSEMGRR 8500 
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AB025413 8585 bp mRNA linear ROD 08-MAY-1999 

Mus musculus mRNA for Ten-m4, complete cds . 

AB025413 

AB025413 .1 GI:4760781 
Ten~m4 . 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

1 (sites) 

Oohashi,T., Zhou,X., Feng,K., Richter,B., Moergelin, M. , Perez, M.T., 

Su,W., Chiquet-Ehrismann,R. , Rauch,U. and Faessler,R. 

Mouse Ten-m/Odz is a new family of dimeric type II transmembrane 

proteins expressed in many tissues 

J. Cell Biol. (1999) In press 

2 (bases 1 to 8585) 
Oohashi,T. 

Direct Submission 

Submitted (29-MAR-1999) Toshitaka Oohashi, Okayama University 
Medical School, Dept. Mol . Biol. & Biochem. ; 2-5-1 Shikata-cho, 
Okayama , Okayama 700-8252, Japan 

(E-mail : oohashi@cc . okayama-u .ac.jp, Tel : +81-86-235-7128 , 
Fax:+81-86-222-7768) 

Location/Qualifiers 

1. .8585 

/organism="Mus musculus" 

/ mo 1_ t ype = " mRNA " 

/strain= "Balb/c" 

/ db_xr e f = " t axon : 1 0 0 9 0 " 

/tissue_type=" brain" 

/ dev_s t age = " adul t " 

1. .8585 

/gene=" ten-m4 11 
188. .8503 
/gene= " ten-m4 " 
/codon_start=l 
/product = " Ten-m4 " 
/protein_id= " BAA77399 . 1 " 
/db_xref="GI: 4760782" 

/ translation= " MDVKERKPYRSLTRRRDAERRYTSSSADSEEGKGPQKSYSSSET 
LKAYDQDARLAYGS RVKDMVPQEAEEFC RTGTNFTLRELGLGEMTP PHGTL YRTD I GL 
PHCGYSMGASSDADLEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHE 
NTETDHPSSLQNHPRLRTPPPPLPHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTDH 
SLSGEPPAGSAQEPTHAQDNWLLNSNIPLETRNLGKQPFLGTLQDNLIEMDILSASRH 
DGAYSDGHFLFKPGGTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFSRPAFNLKKPS 
KYCNWKCAALSAILISATLVILLAYFVAMHLFGLNWHLQPMEGQMQMYEITEDTASSW 
PVPTDVSLYPSGGTGLETPDRKGKGAAEGKPSSLFPEDSFIDSGEIDVGRRASQKIPP 



http://www.ncbi.nlm.nih.gov/entrez/query.fcgi ?cmd=Retrieve&db=nucleotide&list_uids=47607J 10/8/2003 
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GT FWRS QVF I DHPVHLKFNVSLGKAALVG I YGRKGL P P SHTQFDFVELLDGRRLLTQE 
ARSLEGPQRQSRGPVPPSSHETGFIQYLDSGIWHLAFYNDGKESEWSFLTTAIESVD 
NCPSNCYGNGDCISGTCHCFLGFLGPDCGRASCPVLCSGNGQYMKGRCLCHSGWKGAE 
CDVPTNQC I DVAC S SHGTC I MGTC I CNPGYKGE SC EEVDCMD PTC S SRGVCVRGECHC 
SVGWGGTNCETPRATCLDQCSGHGTFLPDTGLCNCDPSWTGHDCSIEICAADCGGHGV 
CVGGTCRCEDGWMGAACDQRACHPRCAEHGTCRDGKCECSPGWNGEHCTIAHYLDRW 
KEGCPGLCNGNGRCTLDLNGWHCVCQLGWRGTGCDTSMETGCGDGKDNDGDGLVDCMD 
PDCCLQPLCHVNPLCLGSPDPLDIIQETQAPVSQQNLNPFYDRIKFLVGRDSTHSIPG 
ENPFDGGHACVIRGQVMTSDGTPLVGVNISFINNPLFGYTISRQDGSFDLVTNGGISI 
ILRFERAPFITQEHTLWLPWDRFFVMETIVMRHEENEIPSCDLSNFARPNPWSPSPL 
TSFASSCAEKGPIVPEIQALQEEIVIAGCKMRLSYLSSRTPGYKSVLRISLTHPTIPF 
NLMKVHLMVAVEGRLFRKWFAAAPDLSYYFIWDKTDVYNQKVFGFSEAFVSVGYEYES 
CPDLILWEKRTAVLQGYEIDASKLGGWSLDKHHALNIQSGILHKGNGENQFVSQQPPV 
IGS IMGNGRRRS I SC PSCNGLADGNKLLAPVALTCGSDGSLYVGDFNYI RRI F PSGNV 
TNILEMRNKDFRHSHSPAHKYYLATDPMSGAVFLSDTNSRRVFKVKSTTWKDLVKNS 
EWAGTGDQCLPFDDTRCGDGGKATEATLTNPRGITVDKFGLIYFVDGTMIRRVDQNG 
IISTLLGSNDLTSARPLSCDSVMEISQVRLEWPTDLAINPMDNSLYVLDNNVVLQISE 
NHQVRIVAGRPMHCQVPGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEKKIN 
RIRQVTTSGEISLVAGAPSGCDCKNDANCDCFSGDDGYAKDAKLNTPSSLAVCADGEL 
YVADLGNI RI RF I RKNKPFLNTQNMYELS S P IDQEL YLFDTSGKHLYTQSLPTGDYLY 
NFT YTGDGD I TH I TDNNGNMV1WRRD STGM 

GHELAMMTYHGNSGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSV 
HVQVETS SKDDVT I TTNL S ASGAF YTLLQDQVRNS YY I GADGSLRLLLANGMEVALQT 
E PHLLAGTVNPTVGKRNVTL P I DNGLNLVEWRQRKEQARGQVTVFGRRLRVHNRNLLS 
LDFDRVTRTEKI YDDHRKFTLRI L YDQAGRP SLWS P S SRLNGVNVT YS PGGH I AG I QR 
GIMSERMEYDQAGRITSRIFADGKMWSYTYLEKSMVLHLHSQRQYIFEFDKNDRLSSV 
TMPNVARQTLETIRSVGYYRNIYQPPEGNASVIQDFTEDGHLLHTFYLGTGRRVIYKY 
GKL SKLAETL YDTTKVS FT YDETAGMLKTVNLQNEGFTCT I RYRQ I G PL I DRQI FRFT 
EEGMVNARFDYNYDNSFRVTSMQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDI 
NQ 1 1 TT AVMTHTKHF D A YGRMKE VQ YE I F RS LMYWMTVQ YDNMGRWKKELKVG P YAN 
TTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSPGNSARLTPLRYDLRDRIT 
RLGDVQYKMDEDGFLRQRGGDVFEYNSAGLLIKAYNRASGWSVRYRYDGLGRRVSSKS 
SHSHHLQFFYADLTNPTKVTHLYNHSSSEITSLYYDLQGHLFAMELSSGDEFYIACDN 
I GTPLAVF SGTGLM I KQ I L YTAYGE I YMDTNPNFQ 1 1 1 G YHGGL YDPLTKLVHMGRRD 
YDVLAGRWTSPDHELWKRLSSNSIVPFHLYMFKNNNPISNSQDIKCFMTDVNSWLLTF 
GFQLHNVI PGYPKPDTDAMEPSYELVHTQMKTQEWDNSKS ILGVQCEVQKQLKAFVTL 
ERFDQLYGSTITSCQQAPETKKFASSGSIFGKGVKFALKDGRVTTDIISVANEDGRRI 
AAILNNAHYLENLHFTIDGVDTHYFVKPGPSEGDLAILGLSGGRRTLENGVNVTVSQI 
NTMLSGRTRRYTDIQLQYRALCLNTRYGTTVDEEKVRVLELARQRAVRQAWAREQQRL 
REGEEGLRAWTDGEKQQVLNTGRVQGYDGFFVTSVEQYPELSDSANNIHFMRQSEMGR 
R" 

BASE COUNT 2080 a 2478 c 2295 g 1732 t 

ORIGIN 

1 ctacctctga aattcagcaa tacccttgaa ctgaagttga aaaaggagaa ctggggtccc 
61 tggagagggg gctgcagggc ccgaggacac caactcttcc cagagcgctg gcatggagcc 
121 agaccactcg gccctgagtg cggcaagggc ccaattcgtg gatgtggaag agcgagagcc 
181 agaggccatg gacgtgaagg agaggaagcc ctatcgctcg ctgacccggc gtcgcgatgc 
241 tgagcgccgc tacaccagct catcggcaga cagcgaggag ggcaagggcc cacagaagtc 
301 ctacagctcc agtgagaccc tgaaggcata tgaccaagat gctcgcctag cctacggcag 
361 ccgcgtcaag gacatggtac cacaggaggc cgaggagttc tgccgcacag gcactaattt 
421 cactcttcgt gaactgggac tgggagagat gacgccccct catgggactc tctacaggac 
481 agacatcggc ctcccacact gtggctattc catgggggcc agctctgatg cagacttgga 
541 agcagacact gtgctatccc ctgaacaccc agtgcgcctg tggggccgga gcacacggtc 
601 agggcgcagt tcctgcctgt ccagccgggc caactccaac ctcacactca cggacacaga 
661 gcatgagaac acagaaaccg atcatcccag cagcctgcag aaccaccctc ggctccggac 
721 gcctcctcca ccactgcccc acgcccatac ccccaaccag caccacgcgg cctccatcaa 
781 ctccttgaac aggggcaact tcaccccaag gagcaacccc agcccagcac ccacagacca 
841 ctcactctct ggggagcccc cagcaggcag cgcccaggag ccaacccatg cccaggacaa 
901 ctggctactc aacagtaaca tcccactgga aaccagaaac ctaggcaagc agccattcct 
961 agggacattg caggacaacc tcattgagat ggacattctc agcgcctccc gccatgatgg 
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1021 ggcttacagt gacgggcact tcctcttcaa gcccggaggc acctccccgc tcttctgcac 

1081 cacatcccca gggtaccccc taacgtctag caccgtgtac tcgcccccac cccggcccct 

1141 gccccgcagc accttctccc gaccagcctt taacctcaag aagccttcca agtactgcaa 

1201 ctggaagtgt gcagcgctga gcgccatcct catctcagct acgctcgtca tcctgctcgc 

1261 atactttgtg gccatgcacc tgtttggcct aaactggcac ctgcagccga tggaggggca 

1321 gatgcagatg tatgagatca cggaggacac agccagcagc tggcctgtgc caacggacgt 

1381 ctccctgtat ccctcagggg gcactggctt agagacccct gacaggaaag gcaaaggagc 

1441 cgcagaagga aagccaagta gtttatttcc agaggacagt tttatagatt ctggagaaat 

1501 cgatgtgggg agacgggctt cccagaagat tccgcctgga acgttctgga gatctcaggt 

1561 gttcatagac caccctgtac acctgaagtt caatgtgtct ctggggaaag cagctctggt 

1621 tggcatttat ggcagaaaag gccttcctcc ttcccatact cagtttgact ttgtggagct 

1681 cctggatgga agaaggctcc taacccaaga ggcaaggagc ctggagggtc ctcagcgcca 

1741 atcacggggc cctgttcccc catccagcca tgagaccggt ttcatccagt atctggattc 

1801 aggaatctgg cacctggcct tttacaatga tggaaaagaa tctgaagtgg tttcatttct 

1861 caccaccgcc attgaatcag tggataactg ccccagcaac tgctatggta acggtgactg 

1921 catttccggg acctgccact gcttcctggg ttttctgggc cctgactgtg gccgagcatc 

1981 ctgtcccgtg ctgtgcagtg ggaacggcca gtacatgaag ggcaggtgcc tgtgccacag 

2041 tggctggaag ggtgccgaat gtgacgtgcc taccaaccag tgtatcgacg tggcctgcag 

2101 cagtcatggc acctgcatca tgggcacctg catctgcaac cccggctaca agggcgagag 

2161 ctgcgaggaa gtggactgca tggaccccac atgttccagc aggggtgtct gtgtcagagg 

2221 cgagtgccac tgctctgtgg gatggggagg caccaactgc gagacaccca gggccacatg 

2281 cttggaccag tgctcaggcc acggaacttt cctcccagac acaggacttt gcaactgtga 

2341 cccaagctgg actggacacg actgttccat tgagatctgt gccgctgact gtggtggcca 

2401 tggcgtctgc gtaggaggca cctgccgctg tgaggatggt tggatggggg ccgcatgcga 

2461 ccaacgggcc tgccacccac gctgtgcaga acacgggacc tgccgggacg gcaagtgcga 

2521 atgcagcccc ggctggaatg gagagcactg caccatcgct cactatctgg atagggtagt 

2581 taaagagggc tgtcctggct tgtgcaatgg aaatggcaga tgtaccctgg acctgaatgg 

2641 gtggcactgt gtctgccagc tgggctggcg agggactggc tgcgacacat ccatggaaac 

2701 gggctgtgga gacggcaagg acaacgacgg agatggcttg gtggactgca tggaccctga 

2761 ctgctgcctc cagcctctct gtcatgtcaa cccgctgtgc ctaggctccc ctgaccccct 

2821 ggacatcatc caagagacac aggcccctgt atcccagcag aacctgaacc ccttctacga 

2881 tcgaatcaag ttcctcgtgg gcagggacag cacacacagc atccccgggg agaacccctt 

2941 tgacggaggg catgcttgtg tcatccgtgg acaagtgatg acgtcagatg ggaccccatt 

3001 ggttggcgtg aacatcagtt tcatcaataa ccctctcttt ggatacacaa tcagcaggca 

3061 agatggcagc tttgacctgg tcacaaatgg cggcatctcc atcatcttga gattcgaaag 

3121 ggcacccttc atcacacagg agcataccct ctggctaccc tgggatcgct tctttgtcat 

3181 ggaaaccatc gtcatgagac acgaggagaa tgagatcccc agctgtgacc tgagcaactt 

3241 tgcccgtccc aaccctgtgg tctctccatc cccactgaca tcattcgcca gctcctgtgc 

33 01 tgagaaaggc cccattgtcc cagaaatcca ggccctgcaa gaggaaatcg tcatcgctgg 

33 61 ctgcaagatg aggttgagct acctgagcag ccgcactcct ggctataagt ctgtcctgag 

3421 gatcagtctc acacacccca ccatcccctt caacctcatg aaggtccacc tcatggtggc 

3481 agtcgagggt cggctcttca ggaagtggtt tgccgcagct cctgacctgt cctattactt 

3541 catctgggac aagacagatg tctacaacca gaaggtgttt ggattctcag aagcctttgt 

3 601 ttccgtgggt tatgagtatg aatcctgccc agatctgatc ctgtgggaga aaaggacagc 

3661 ggtgctgcag ggctatgaaa ttgatgcctc caagctgggg ggctggagtc tggataagca 

3721 ccatgccctg aacatccaga gtggcatcct gcacaaaggg aatggagaga accagtttgt 

3781 gtcccagcag ccaccggtca tcggcagcat catgggcaat gggcgcagga gaagtatctc 

3841 ctgccccagc tgcaatggtc ttgctgatgg caacaagctc ctggcccccg tggccctcac 

3901 ctgcggctct gatgggagtc tctacgtggg agacttcaac tacatccgca gaatcttccc 

3961 ctctggaaat gtcaccaaca tcctggagat gagaaataaa gatttcagac atagtcacag 

4021 cccagcacac aaatactacc tggctacaga ccccatgagt ggggccgtct tcctgtctga 

4081 caccaacagc cggcgggtct tcaaggtcaa gtccaccaca gtggtgaagg acctggtgaa 

4141 gaactccgag gtggtagcag ggactggtga ccagtgcctc ccctttgatg atacccgctg 

42 01 cggagatggt gggaaagcca cagaagccac gctcactaac cccaggggaa ttacagtgga 

42 61 caagtttggg ctcatttatt tcgtggacgg caccatgatc agacgtgttg atcaaaatgg 

4321 aatcatctcc actttgctgg gctccaatga cctcacctcg gccaggcccc tcagctgtga 

4381 ctccgtcatg gagatttctc aggttcgcct ggagtggccc acagacttag ccatcaaccc 

4441 aatggataat tctctctatg tcctcgacaa caatgtggtc ctgcaaatct ccgaaaacca 

4501 ccaggtacgc attgtcgctg ggaggcccat gcactgccag gtccctggca tcgaccactt 

4561 cctgctgagc aaggtagcca tccatgcgac cctggagtca gccactgctc tggccgtttc 
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" 4621 acacaatgga gtcttgtaca 
4681 ggtcactaca agtggtgaga 
4741 aaatgatgcc aactgtgact 
4801 gaatacccca tcgtccttgg 
4861 aaacatccga attcgattta 
4921 cgagctatcc tcccccatcg 
4981 gtacactcag agcctaccca 
5041 ggacatcaca catatcaccg 
5101 cgggatgcct ctctggctgg 
5161 caacagcgca ctcagaagtg 
5221 tggcaactct ggcctcttgg 
5281 gtatgacagt tttggtcgcc 
5341 ccgaagcgat acagacagct 
5401 caccataacc accaacctgt 
5461 ccggaacagc tactacatcg 
5521 ggaagtggct ctgcagactg 
5581 caagaggaat gtcacactgc 
5641 caaggagcag gctcgtggcc 
5701 aaacctcttg tctttggact 
5761 tcgcaagttc acccttcgga 
5821 tagcagcagg ctgaatggtg 
5881 ccaaaggggc atcatgtctg 
5941 gatctttgca gacgggaaaa 
6001 tctccacagc cagaggcagt 
6061 gaccatgccc aacgtagccc 
6121 gaacatctac cagcccccgg 
6181 acacctgtta cataccttct 
6241 gttgtcaaag ctggccgaga 
6301 gaccgcaggg atgctgaaga 
6361 ctaccgtcag attgggcccc 
6421 ggtcaatgcc cgttttgatt 
6481 tgtgatcaat gagaccccac 
6541 gacagagcag tttgggaagt 
6601 agccgtcatg acccacacca 
6661 tgagattttc cggtcactca 
6721 agtgaagaag gagctgaagg 
6781 tgatgctgat ggccagctgc 
6841 ctatgacctc aatgggaacc 
6901 actacggtat gacctccgtg 
6961 tgaggatggc ttcctgaggc 
7021 gctcatcaaa gcctacaacc 
7081 gggacgccga gtatccagca 
7141 cctgaccaac cccaccaagg 
7201 cctctactat gatctgcaag 
7261 ttacatagct tgtgacaaca 
7321 gattaagcag atcctataca 
7381 ccagatcatc atcggctacc 
7441 gggccgacgg gattatgatg 
7501 gaaacgcctg agtagcaaca 
7561 ccccatcagc aactctcagg 
7621 cacctttgga ttccagctgc 
7681 catggaaccc tcctacgagc 
7741 caagtctatc ctcggggtac 
7801 agaacgcttt gaccagctct 
7861 aaagaagttt gcctccagtg 
7921 tggtcgagtg accactgaca 
7981 catcttgaac aatgctcact 
8041 ccactacttt gtgaaaccag 
8101 ggggcggaga accctggaga 
8161 cagtggcagg actagacgct 



tcgctgagac cgacgagaaa 
tctcactggt tgctggtgcc 
gcttctctgg agatgatggt 
ctgtgtgtgc tgacggggag 
tccggaagaa caagcctttc 
accaggagct gtacctcttt 
caggggacta cctgtacaac 
acaacaatgg caacatggtg 
tagtcccaga tggccaggta 
tgaccacaca aggacacgag 
caaccaaaag caatgaaaac 
tgacaaacgt gacctttcca 
cagtgcacgt gcaggtagag 
ctgcttcggg tgccttctac 
gggctgatgg ctccctgagg 
agccacacct gctggctggc 
ccattgataa tggcctcaac 
aggtcaccgt ctttggacgc 
ttgaccgtgt cacacgcaca 
tcctatatga ccaggcaggg 
ttaatgtgac ctactcccct 
agagaatgga atatgatcag 
tgtggagcta cacgtactta 
acatcttcga gtttgacaag 
ggcagacgct ggagaccatc 
aaggcaacgc ctcagtcatt 
acttgggcac cggccgccgg 
ctctgtatga caccactaag 
ctgtcaacct acagaatgag 
tgattgatcg gcagatcttc 
acaactatga caacagtttc 
tgcccattga cctctaccgc 
ttggtgtcat ctactacgac 
agcactttga tgcttatggc 
tgtactggat gactgttcag 
tgggacccta tgccaacact 
agacagtctc catcaatgac 
tacacttgct gagccctggg 
accgcatcac taggctgggt 
agcggggtgg ggatgtcttc 
gggctagtgg gtggagtgtc 
agagcagcca cagccaccac 
tcacccacct ctacaaccac 
gacacctctt tgccatggag 
tcgggacccc gcttgctgtc 
cggcctatgg ggagatctac 
acggcggcct ctatgatcca 
tgctggctgg acgctggaca 
gcatcgtgcc ttttcatctc 
acatcaagtg cttcatgaca 
acaacgtgat acctggctat 
tcgtacacac acagatgaaa 
agtgtgaagt tcagaagcaa 
acggctcgac catcaccagc 
gttccatctt tggcaagggg 
tcatcagtgt ggccaatgag 
atctagagaa ccttcacttc 
gaccttcgga aggtgacctg 
atggggtcaa cgtcactgtg 
acactgacat ccagctgcag 



aagatcaatc gcatcaggca 
cccagtggct gtgactgtaa 
tacgccaagg atgcaaagct 
ctctatgtgg ccgacctggg 
ctgaacactc agaacatgta 
gataccagtg gcaagcatct 
ttcacttaca caggggacgg 
aacgtccgcc gagactctac 
tactgggtaa ccatgggcac 
ctagccatga tgacctacca 
gggtggacaa cgttttatga 
actggccagg tgagcagttt 
acctcaagca aagatgacgt 
accctgttac aagaccaagt 
ctgctactag ccaatggcat 
actgtcaacc ccactgtagg 
ctggtggagt ggcggcagcg 
cgtctgcggg ttcacaaccg 
gagaagatct acgatgacca 
aggcccagcc tctggtcacc 
ggaggtcaca ttgctggaat 
gcagggcgca tcacatcccg 
gagaagtcca tggtgcttca 
aatgaccgcc tctcttctgt 
cgctcagtgg gctactacag 
caggacttca ctgaggatgg 
gtgatttaca agtatggcaa 
gtaagcttca cctacgacga 
ggcttcacct gcactatccg 
cgtttcacgg aagaaggcat 
cgtgtgacta gcatgcaggc 
tatgatgatg tgtcagggaa 
atcaaccaga tcattaccac 
aggatgaagg aagtacagta 
tatgataaca tgggacgggt 
acccgctact cctatgagta 
aagccactct ggcgctacag 
aacagcgcac ggctcacacc 
gatgtacagt acaagatgga 
gagtacaatt cagccgggct 
aggtaccgat acgatgggct 
ctgcagttct tctacgcaga 
tccagctctg agattacatc 
ctgagcagtg gcgatgagtt 
ttcagtggaa ctggcttgat 
atggacacaa accccaactt 
ctcaccaagc ttgtccacat 
agcccagacc atgaactctg 
tacatgttta agaacaacaa 
gatgtcaaca gctggctcct 
cccaagccag acacagatgc 
actcaggaat gggacaacag 
ctcaaggctt tcgttacctt 
t^ccaacagg cccctgagac 
gtcaagtttg ccttgaaaga 
gatgggcgga ggatcgcagc 
accattgatg gggtggacac 
gccatcctgg gcctcagtgg 
tcccagatca acaccatgct 
tacagggcgc tgtgcctgaa 
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8221 cacccgctac gggacaacag tggatgagga 

8281 gagagctgtg cgccaggctt gggcccgaga 

8341 cctgcgggcc tggacagatg gggaaaagca 

8401 ctacgacggc ttctttgtga cctcggtcga 

8461 caatatccac ttcatgagac agagcgagat 

8521 tggacttctt gccaaagaca gctactcttc 

8581 aacaa 



aaaggtgcgg gtgctggagc tggccaggca 
gcagcagaga ctgcgggaag gggaagaggg 
gcaggtgctg aacacggggc gggtgcaagg 
gcagtaccca gaactgtcag acagcgccaa 
gggccgaagg tgacagaaag ggccaaggcc 
tgtggccgta ttcctgactg tgttgtactt 
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